The Effectiveness of Results Re-Ranking and Query Expansion in Cross-language Information Retrieval

نویسندگان

  • Dong Zhou
  • Vincent P. Wade
چکیده

This paper presents the technique details and experimental results of the information retrieval system with which we participated at the NTCIR-8 ACLIA (Advanced Cross-language Information Access) IR4QA (Information Retrieval for Question Answering) task. Document corpus in Simplified Chinese (CS) and Traditional Chinese (CT) with topics in English, CS and CT were used in our experiments. We combined the query expansion and initial retrieval results re-ranking techniques as main retrieval approach. The experimental results confirmed that query expansion based on Bose-Einstein distribution and re-ranking method based on Latent Dirichlet Allocation (LDA) are able to consistently bring significant improvements over various baseline systems. Especially the approach is capable of processing mixedmultilingual text obtained by a machine translator for crosslanguage information retrieval (CLIR). The results obtained might provide us more insight and understanding into cross-language query expansion and document re-ranking.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

QEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches

A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...

متن کامل

A Sequential Frequent Pattern Mining Framework for Personalized Xml Retrieval

With the huge development of internet, the information retrieval became tough and unreliable. Users interest and need is differs at every time. In order to improve the searching experience, several personalized search techniques are proposed. Using the information about the user, their history and query behavior the results will be reproduced. This kind of query reproduction is known as persona...

متن کامل

Term Similarity-Based Query Expansion for Cross-Language Information Retrieval

We propose a query expansion technique which is based on a statistical similarity measure among terms to improve the effectiveness of the dictionary-based cross-language information retrieval (CLIR) method. We employ a term similarity-based sense disambiguation technique proposed in our earlier work to enhance the accuracy of the dictionary-based query translation method. The query expansion te...

متن کامل

Cross-Language Information Retrieval via Hybrid Combination of Query Expansion Techniques

This paper describes a new approach in Cross-Language Information Retrieval that combines query expansion techniques before and after query translation and disambiguation. Moreover, a new technique based on domain keywords extraction is proposed. Test results showed the effectiveness of the combined method.

متن کامل

University of Chicago at CLEF2004: Cross-language Text and Spoken Document Retrieval

The University of Chicago participated in the Cross-Language Evaluation Forum 2004 (CLEF2004) cross-language multilingual, bilingual, and spoken language tracks. Cross-language experiments focused on meeting the challenges of new languages with freely available resources. We found that modest e ectiveness could be achieved with the additional application of pseudo-relevance feedback to overcome...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010